
4 ' iTT/h Jv° U h T 6 7 v dataSet WUh '” =10000 ° examples and n-15 features 
for each example. You want to use multivariate linear regression to fit the 

e^donT ^ Sh ° Uld y °" Pref “ gradient descent or 1116 

a) ^optimal e ,Uatl ° n ’ since gradiem descent might be unable to find 

b) S Sr tion ’ since ft provides “ " "V 

c) Gradient descent, since it will always converge to the optimal 0. 

d) Gradient descent, since (X T X) J will be veiy slow to compute in the 
normal equation. 

5. Suppose you train a logistic classifier h 0 (x)=g(d 0 +G I x 1 +e 2 x 2 ). Suppose 
9°~~\ 6 ’ fy~°> an d 0 2 =1. Which of the following figures represents the 
decision boundary found by your classifier? 
t a) b) 






6. You are training a classification model with logistic regression. Which of 
the following statements are true? Check all that apply. 

a) Introducing regularization to the model always results in equal or 
better performance on the training set. 

b) Adding a new feature to the model always results in equal or better 
performance on the training set. 

c) Adding a new feature to the model always results in equal or better 
performance on examples not in the training set. 

d) Introducing regularization to the model always results in equal or 
better performance on examples not in the training set. 

7. Consider an A* search algorithm for which h( n) = 0. To which of the 
following search algorithms is this A* equivalent? 

a) Greedy best-first search 

b) Depth-First Search 

c) Uniform Cost Search 

d) None of the above. 


8. K-means is an iterative algorithm, and two of the following steps are 
repeatedly carried out in its inner-loop. Which two? 

a) The cluster assignment step, where the parameters CP are updated. 

b) Move the cluster centroids, where the centroids are updated. 

c) Feature scaling, to ensure each feature is on a comparable scale to 
the others. 

d) Using the elbow method to choose K. 


1 . Consider the following map (not drawn to scale). 



Use the A* algorithm to work out a route from town A to town M. Use the following 

cost functions. , , , 

> G(n) = The cost of each move as the distance between each town (shown on 

> HOa) - The Straight Line Distance between any town and town M. These 
distances are given in the table below. 

V G H I J kTTTmI 

~56 1 29 1 29 1 30 1 14 1 1 0 1 8 j 5 30 15 Oj 

^p7 ovi d e the search tree for your solution, showing the order in which the 

nodes were expanded and the cost at each node. You should not re-visit a 
town that you have just come from. State the route you would take and 

the cost of that route. , , , , • 

b) Assume the estimated costs by the heunsttc were replaced and shown m 

the following table 

C K ~~f— FT~ I H I I 1 j I K | L 1_mJ 

-A TK ^ V) 60 30~ 20 50 I 60 1 20 j 0_] 

whSr^n^^ and wha * would 0051 ot 

tha ' r0l “c) b C 7 omme„t on the optimality of the two A* algorithms. How do you 
account for the different routes returned. 



2. Prove that A* tree search with admissible heuristic is optimal. 

3. Consider the problem of predicting how well a student does in his second 
year of college/university, given how well they did in their first year. 
Specifically, let x be equal to the number of "A" grades (including A-. A and 
A+ grades) that a student receives in their first year of college (freshmen 
year). We would like to predict the value of y, which we define as the 
number of "A" grades they get in their second year. 

Use the following training set of a small sample of different students' 
performances. Here each row is one training example. 


X 

y 

3 

2 

1 

2 

0 

1 

4 

3 

5 

4 

3 

4 


Recall that in linear regression, the hypothesis is 

he (x) — 6 0 H- 0\x 

, and we use m to denote the number of training examples. 

a) For the training set given above, what is the value of m ? 

b) Recall the definition of the cost function What is J(0, 1)1 

•'( 00,00 = ££( M * w )- t / (i) ) 2 

i=l 

c) Suppose we set 9 0 =-l and 9 t =2 . What is h e (6)l 
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